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Abstract 





Self-organizing maps (SOMs) are non-supervised neuronal networks based on several biological principles. This 


type of network has been found to provide good performance for several pattern recognition networks, including sound 


and image applications. Based on multiset concepts, the common product and respective correlation has been found to 


typically outperform extensively employed methods such as the cosine similarity, inner producte and cross-correlation. 


In the present work, we bring these two research areas together by developing a self-organized network whose neurons 


incorporates the common product instead of the inner product as a model of synaptic transduction and integration. It 


is shown that this type of obtained neuronal networks present several interesting characteristics including partitioned 


regions with more uniform areas as well as enhanced cluster separation. 


“Soap bubbles, those masters of space.’ 


LdaFC 


1 Introduction 


Though it often goes not realized, pattern recognition 
(e.g. |1, 2, 3]) underlies a good deal of human (as well 
as other living beings) activities which include but are 
by no means limited to: reading these lines, hearing a 
distant bird song, and smelling springtime. Virtually ev- 





ery of these perceptions (e.g. [4, 5]) are achieved through 
intricate neuronal systems perfect along millions of years. 

Though the ubiquity of patter recognition may not 
be so often acknowledge, its importance is nonetheless 
strongly reflected on the continuing and intense efforts 
aimed at achieving effective machine learning correlates. 
Though dating back to William James’s studies in the 
the XIX century, the important area of neuronal pattern 
recognition underwent important developments more re- 
cently in the deep learning initiative (e.g.[6, 7]). 

One of the neuronal models that have played an im- 
portant role in patter recognition, including respectively 
to deep learning, is the Kohonen’s self-organizing map ap- 
proach (e.g. [8]), which was strongly motivated by findings 
from neurobiology and neuroscience (e.g. [5, 4], including 
not only at the neuronal cell level, but also in the way 





in which these cells organize themselves topographically 
along the mammals cortex so that nearby neurons tend to 


have similar operation, and in the ultimate consequences 
of resulting neuronal modules associated to specific pat- 
terns, which accounts for the self-organized map — SOM 
— name. 

In their simplest form, these neuronal networks are 
characterized by having its input layer connected directly 
to single ‘cortical’ layer, where linear neurons are dis- 
tributed uniformly. Each neuron performs an inner prod- 
uct between its current weights and the input pattern. 
Therefore, the neuronal output can be understood to re- 
flect the similarity between the input pattern and the cur- 
rent weight values. 

The learning procedure goes as follows. For each pre- 
sented stimulus, the weights modification takes place 
around the cell presenting the highest response. ‘These 
modifications are often performed through a simple rule 





taking into account the vector diference between the input 
pattern and the weights, while also taking into account a 
learning rate a and the distance between the neighboring 
cells and that which presented the highest output. ‘The 
further away a cell is from the winning cell, the less in- 
tense the weight correction is performed. The repetition 





of this learning stage, in presence of possible relaxation of 





the learning parameters, has been verified to yield parti- 
tions of the cortical layer into portions corresponding to 
the types of presented patterns. 

Though the inner product has been extensively adopted 
in neuronal networks as the means to quantify the simi- 
larity between each pattern and the weight distribution, 


there is a virtually infinite number of other possible simi- 
larity approaches. More recently [9, 10, 11], the classic 
Jaccard index [12], whose application was largely lim- 
ited to binary or categorical data, was adapted to take 
into account real, possibly negative values, as allowed 
by the consideration of multiset concepts and methods 
(e.g. 113, 14, 15, 16, 17, 18]) . Since this index has been 
shown not to take into account the relative interiority 
between two sets [9], a respective enhancement was pro- 
posed [12] that consists of multiplying the real-valued 
Jaccard index by the interiority index, giving rise to the 
coincidence index |9, 10, 11]. The application of these 
indices to traditional important problems such as tem- 
plate matching pattern recognition [19], complex network 
representation and modularity enhancement [20], as well 
as clustering [21], has indicated that the aforementioned 
multiset-based similarity approaches have enhancement 
for pattern recognition thanks to their increased sensi- 
tivity and accuracy underlain by a respective more strict 
quantification of pairwise similarity between virtually any 
mathematical structure, including sets, multisets, vectors, 
functions and fields. 

In addition, complementary studies [22] indicated that 
the multiset-based similarity indices have enhanced per- 
formance also when the traditional inner product tradi- 
tionally adopted as the means for modeling and perform- 
ing synaptic transduction and integration. In particular, 
individual artificial neuronal cells employing the multiset- 
based approaches, and especially the coincidence index- 
based common product, have substantially improved ca- 
pabilities for achieving accurate, strict and specific recog- 
nition of image patterns 

In the present work we take a step further and inves- 
tigate the performance of the multiset-based approaches 
respectively to whole ensembles of neurons, or neuronal 
networks, while focusing on the SOM approach. 

We start by briefly presenting the multiset-based sim- 
ilarity indices, and then proceed to illustrating their en- 
hanced potential when applied to a SOM network. The 
results obtained so far indicate that, when compared to 
the traditional inner used for product learning rule and 
recognition, the multiset-based approaches tend to yield 
more balanced partitioning of the cortical space, as well 
as increasingly separated frontiers between the resulting 
modules associated to the input patterns. These abilities 
are understood to be a direct consequence of the enhanced 
capability of the multiset-based similarity measurements, 
including the common product, respectively to more strict 
and accurate similarity quantification. 


2 The Common Product and Cor- 
relation 


We start with the classic cross-correlation between two 
functions f(x) and g(x): 


Corfo) = f "e-e 


The basic Jaccard index between two sets with non- 
negative multiplicity can be defined as: 


= TAUB (2) 


where A and B are any two sets to be compared. It 
can be verified that 0 < J (A, B) < 1. 
The homogeneity or interiority index can be expressed 





as: 
IAN B| 


= (3) 
min {|A| , |B|} 

with0 < T(A,B) < Land0 < J(A, B) < H(A, B) < 1. 
The combination of the basic Jaccard and the interior- 


T(A, B) = 


ity indices yields the coincidence index, proposed in [9]: 


C(A, B) = 7(A, B) IAB) (4) 





The Jaccard index extended to multisets with non- 





negative multiplicities can be expressed as: 
N : 

> j—1 min (a;, bi) 
N 

> j—1 max (aj, bi) 


with 0 < Ju (A, B) <. Therefore, this index is capable 
of taking into account the multiplicity of the elements in 


Ju (A, B) = (5) 


the involved multisets. 

The extension of the Jaccard index to real, possibly 
negative-valued multiplicities, involves the following de- 
velopments [9, 10]. 

In particular, the possibility to have positive and/or 
negative multiplicity values now requires the application 
of a binary operator analogous to the inner product in 
function spaces, in which the multiplicities are properly 
mirrored among the four quadrants depending on their 





signs [9, 10]. This operator, which will be referenced here 





as common product, can be expressed as: 


CO 


spsg min(ss f(z), sgg(2))de (6) 


< f(x), g(x) >= | 
— CoO 
This binary operator (in the sense of taking two ar- 
guments) corresponds to the direct counterpart of inner 
product between multisets with non-negative multiplici- 
ties. 
We shall also adopt the following operator: 


f(x) o g(x) = f max(sş f(x), sgg(x))dx (7) 


From the above results, the multifunction convolu- 
tion [9, 10] (mconvolution) of two functions can now be 


derived: 


FOM) = | 


— OO 


CoO 


K f(xjgiy—x)>dxr_ (8) 





Similarly, we derive the multifunction correlation, or 
mcorrelation as: 


FD yl = | 


— OO 


CO 


< f(x)g(a—y)>dx (9) 
It is also possible [10] to define: 


F(x) ® 9(e) fol = f 


— OO 


CO 


[f(x) o g(a — y)| dx (10) 
We also have: 


Ap= | |fle)lae (11) 
Ay= [Iola (12) 


Now, the multifunction correlation, when normalized 
by the above function, yields: 
fo X F(x), glz — y) > dx 
[ox F(@) © g(a — y)da 


observe that the integrand corresponds to the gener- 


f(a)Og(2) [y] = 


alized Jaccard index adapted to cope also with negative 
multiplicities, here called negative multiset Jaccard index. 

The interiority index also needs to be adapted to pos- 
sibly negative multiplicities. First, we make: 


. min(sr f(x), sgg(x))dx (14) 


< f(x), g(x) >4+= 
where S4} = {x|s¢(x)Sq(x) > 0}, and then we can write: 


LSO) > 
T = 15 
Foha) = ag (15) 
The respective combination with the interiority index 
yields the coincidence index |9, 10], written as: 


Cn (f(x), 9(@)) = In(F(@), 9(@)) TF), 9(")) (46) 


Now, when performing template matching between two 
functions, it may be interesting to consider the interiority 
also when s¢(x)s (x) < 0. In this case, it is possible to 
take the absolute value of the object function as argument 
of the interiority index above, which is the case considered 
henceforth in this work. 

Each of these indices lead to respective multifunction 
convolutions and correlations, which involve sliding one 
function with respect to the other while calculating the 
respective index, followed by the respective integration. 
Actually, many neuronal networks including the SOM and 





other deep learning architectures, can be understood as 
being based on the convolution or correlation between the 
stimuli and the cells synaptic weights. 


3 Common Product-Based Self- 
Organizing Maps — CPSOM 


The common product-based self-organizing map — CP- 
SOM — model can be understood as a SOM in which 
the inner product has been replaced by some common 
product-based similarity index, including the real-valued 
Jaccard and coincidence indices. All other aspects, in- 
cluding stimuli presentation and training, are shared with 
the traditional SOM approach. 

In this section we consider the real-valued Jaccard and 
coincidence indices, as well as the classic inner product- 
based cross correlation method. 

The input consisted of guassian paterns placed at three 
distinct positions in the input space (or retina), as illus- 
trated in Figure 1. Both the input (‘retina’) and neuronal 
(‘cortex’) layers have dimension 20 x 20. 
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Figure 1: The four types of patterns considered for training the 
CPSOM were obtained from these four gaussians by addition of 
symmetric uniform noise and small spatial displacements of each 
pattern. 


Several instances of these three types of patterns were 
then obtained by adding symmetric noise and positional 





displacements of each stimuli before respective presenta- 
tion to the CPSOM architecture. The following results 
were obtained for 100 presentations, with the following 
learning rule: 


wE = wll + x E wt a e`’ (17) 


where d is the distance from the current cell to the re- 


spective winning cell at a given epoch t. We have adopted 
a = 0.2 and 6 = 0.1. 
around each winning cell consists of those neurons that 


The neighborhood considered 


are no further apart than R = 10. 
After training, each cell was associated to the noiseless 
version of the pattern to which it produces the highest 





response, therefore defining the partition of the cortex 
space. We also considered an additional result to be vi- 
sualized corresponding to the intensity of the maximum 
response of each of the cells to the associated noiseless 
pattern. 

Figures 2, 2, and 2 illustrates the obtained partitioned 
cortex as well as the respective intensity plots respectively 
to the adoption of the real-valued Jaccard, coincidence 
and classic correlation similarities 


The result obtained for the coincidence method 
presents partitioned regions not only occupying more sim- 
ilar areas, but also better separated in the intensity plot. 
The second best approach was that based on the real- 
valued Jaccard index, followed by that adopting the clas- 
sic cross correlation between the weights and stimuli. 

In addition to these possibly enhanced features regard- 
ing the resulting properties of the partitioned cortex, 
it would also be reasonable to expect that subsequent 
pattern recognition will be performed in a more strict 
and accurate manner by the coincidence-based CPSOM, 
given that this index has shown enhanced capabilities 
when applied to other pattern recognition related prob- 
lems [19, 21, 20, 22]. 

FlIgure 5 presents the resulting cortex and intensity plot 
obtained by using the coincidence-based CPSOM, but 
with the minimum part of the Jaccard sub-index being 
taken to the second power, as suggested in [9]. An even 
better separation of the clusters can be observed. 


4 Concluding Remarks 


Effective, generic pattern recognition constitutes one of 
the greatest challenges that motivates much of current sci- 
ence and technology, given the ubiquity and importance 
of this ability for many human activities. In particular, 
significant recent results in the area of deep learning have 
allowed several difficult problems to be effectively tack- 
led and solved. A good deal of these advances have been 
motivated and relied on concepts and advances from the 
closely related area of neuronal networks (e.g. [23]). 
Oftentimes, the synaptic transduction and integration 
by individual neuronal cells have been modeled and/or 


implemented artificially in terms of the classic inner prod- 
uct. In the present work, we set out at investigating 
the potential of alternative mutiset-based measurements 
of similarity as substitutes for the inner product in the 
important neuronal network known as SOM. 

We implemented a relatively small CPSOM based on 





the multiset-derived real-valued Jaccard and coincidence 
indices, as well as on the classic inner product-based clas- 
sic cross correlation. Four types of patterns, subject to 
respective modifications by symmetric uniform noise as 





well as by small positional displacements, were then fed 
into each of the considered CPSOMs architectures. ‘The 
results, evaluated visually from the obtained cortical par- 
titions and intensity plots, suggest that the multiset meth- 
ods, especially that based on the coincidence index, tend 
to yield more balanced partitions, in the sense of the re- 
In ad- 
dition, more markedly separations can ben obtained in 


spective regions presenting more similar areas. 


the intensity plot obtained by using the coincidence in- 
dex method. These results concern the learning stage, 





but Enhanced performance could also be expected when 

applying the obtained maps to recognize new patters. 
Though our discussion has been based on a specific ex- 

periment reported in Section 3, more experiments (not 





shown) have been performed that tend to corroborate the 
observed advantages of the coincidence index-based CP- 
SOM. Additional studies are being conducted incorpo- 
rating more objective performance quantifications. other 
types of stimuli, as well as cortex sizes, and results should 
be communicated opportunely. 
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Figure 2: The real-valued Jaccard-basedCPSOM ‘cortex’ partitioned into the four types of patterns (a), as well as the respectively maximum 


intensities (b) obtained for the associated patterns. A good separation of the pattern types has been obtained, though being a little blurred. 
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Figure 3: The coincidence-based CPSOM ‘cortex’ partitioned into the four types of patterns (a), as well as the respectively maximum 


intensities (b) obtained for the associated patterns. 
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Figure 4: The inner product-based CPSOM ‘cortex’ partitioned into the four types of patterns (a), as well as the respectively maximum 


intensities (b) obtained for the associated patterns. Not only the obtained patter regions occupy markedly different areas, but also the 


pattern separations in the intensity plot are much weaker and blurred than obtained for the coincidence CPSOM or even the real-valued 


Jaccard CPSOM. 
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Figure 5: The partitioned cortex and respective intensity plot obtained by the coincidence-based CPSOM, but having the minimum operation 


(numerator) of the involved real-valued Jaccard squared. In addition to partitioned regions with similar areas, an even better separation of 


the clusters can be observed. 
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